Topic Scaling: A Joint Document Scaling–Topic Model Approach to Learn Time-Specific Topics
نویسندگان
چکیده
This paper proposes a new methodology to study sequential corpora by implementing two-stage algorithm that learns time-based topics with respect scale of document positions and introduces the concept Topic Scaling, which ranks learned within same scale. The first stage documents using Wordfish, Poisson-based document-scaling method, estimate serve, in second stage, as dependent variable learn relevant via supervised Latent Dirichlet Allocation. novelty brings two innovations text mining it explains positions, whose is latent variable, inferred on match their occurrences corpus track evolution. Tested U.S. State Of Union two-party addresses, this inductive approach reveals each party dominates one end interchangeable transitions follow parties’ term office, while shows for German economic forecasting reports shift narrative style adopted institutions following 2008 financial crisis. Besides demonstrated high accuracy predicting in-sample from topic scores, method unfolds further hidden differentiate similar increasing number expand potential nested hierarchical structures. Compared other popular models, Scaling similarities without specifying time frequency evolution, thus capturing broader patterns than dynamic models yielding more interpretable outputs plain
منابع مشابه
Finding document topics for improving topic segmentation
Topic segmentation and identification are often tackled as separate problems whereas they are both part of topic analysis. In this article, we study how topic identification can help to improve a topic segmenter based on word reiteration. We first present an unsupervised method for discovering the topics of a text. Then, we detail how these topics are used by segmentation for finding topical si...
متن کاملAssociation between prostate specific antigen change over time and prostate cancer recurrence risk: a joint model
Background: Prostate specific antigen (PSA) is an important biomarker to monitor patients after treated with radiation therapy (RT). The aim of this study is to evaluate the relationship between the PSA data and prostate cancer recurrence using the joint modeling. Methods: This historical cohort study was performed on 422 prostate cancer patients. Inclusion criteria included: patients with loc...
متن کاملAn Automatic Approach for Document-level Topic Model Evaluation
Topic models jointly learn topics and document-level topic distribution. Extrinsic evaluation of topic models tends to focus exclusively on topic-level evaluation, e.g. by assessing the coherence of topics. We demonstrate that there can be large discrepancies between topicand documentlevel model quality, and that basing model evaluation on topic-level analysis can be highly misleading. We propo...
متن کاملA Dynamic Topic Model for Document Segmentation
Factor language models, like Latent Semantic Analysis, represent documents as mixtures of topics, and have a variety of applications. Normally, the mixture is computed at the whole-document level, that is, the entire document contains material on several topics, without specifying where they occur in the document. In this paper, we describe a new model which computes the topic mixture estimate ...
متن کاملA model to learn from mistakes (using TOPSIS)
Learning is a new paradigm that is introduced in today's organizations. The purpose of this study is to determine learning contingency model from the mistakes using TOPSIS. Variables were identified using the Delphi technique. The samples of this study were consisted 30 public administrators in Iran. The results show which factors have the greatest impact and which causes the least impact on le...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Algorithms
سال: 2022
ISSN: ['1999-4893']
DOI: https://doi.org/10.3390/a15110430